Abstract
Chronic lymphocytic leukemia (CLL) evolves through progression and relapse. Intra-leukemic epigenetic heterogeneity in DNA methylation (DNAme) has been shown to co-operate with genetic heterogeneity in increasing CLL's evolutionary capacity. However, DNAme constitutes only one layer of epigenetic information. To comprehensively study the epigenetic landscape of CLL and its intra-leukemic heterogeneity, we complemented bisulfite sequencing with RNA-seq and a ChIP-seq compendium of histone modification maps (H3K4me3, H3K27ac, H3K27me3, H3K9me3, H3K36me3) in a cohort of 17 primary CLL and 5 normal B cell samples.
Analysis of H3K4me3 and H3K27ac distribution revealed that CLL is associated with greater enhancer activation than normal B cells, with enhanced NFAT, FOXP1, NFkB, and BACH2 activity, as well as downregulated TP53 activity. 1,013 super-enhancers were differentially regulated in CLL compared with normal B cells, with preferential association for genes critical for lymphocyte proliferation and differentiation (including BCL2, TCF/LEF, and EP400) (Figure A).
Next, we hypothesized that intra-leukemic admixtures of epigenetic states and stochastic epigenetic changes during CLL evolution, may lead to reduced co-ordination between any two layers of epigenetic or transcriptional information. To assess this, we drew on a fundamental metric in information theory - mutual information (MI) - which measures how much can be learned from one variable about another. Normal B cells exhibited highly coordinated transcriptional regulation, resulting in higher pairwise MI compared with CLL samples. For example, we observed a 13% decrease in DNAme-RNAseq MI in CLL relative to normal B cells, suggesting that stochastic changes and intra-leukemic heterogeneity decreases the MI of these two variables when measured at the population level (Figure B).
To extend the evaluation of epigenetic co-ordination beyond mutual information of two variables, we modeled the combinatorial patterns of histone modifications, DNAme, and gene expression using a Dirichlet Process Mixture (DPM), which allows learning de novo the number of combinatorial states. DPM analysis revealed that while in normal B cells the combinatorial chromatin states were generally associated with a uniform transcriptional output, in CLL many of these states were associated with variable expression level. For example, transcription start sites (TSSs) with H3K27me3hi/H3K4me3low/H3K27aclow, that in normal B cells were associated with uniform gene silencing, were 3 times more likely to be associated with gene expression in CLL (P < 0.0001, Figure C).
To validate the greater combinatorial epigenetic complexity in CLL, we applied an orthogonal method by training a multivariate Hidden Markov Model (HMM) in CLL and normal B cells. We identified convergence at 12 states(Figure D). HHM distinguished between CLL and normal B cells, with ~20,000 segments assigned to discrepant states between CLL and normal cells, enriched for TP53 regulated genes. Consistent with greater combinatorial complexity, we observed higher within-state variation in CLL, with active TSSs, transcribed regions, and poised TSSs exhibiting the largest fold-change increase in within-state variability (2.7, 2.5, and 1.4, respectively).
Interestingly, HMM also revealed a CLL specific state (P < 0.0001) marked by intermediate H3K27ac and H3K27me3, which are typically exclusive. This state was >2-fold more common in CLL than normal B cells. CLL cells harbored ~1.6M segments of this novel state often located at non-first introns and LTRs and distal regulatory elements. Interestingly, these segments, repressed by Polycomb in normal B cells, were also more likely to be reactivated in CLL (OR: 1.98; CI: 1.39-2.82; P < 0.0001), putatively enabling higher MYC activity by motif enrichment analysis (Figure D-inset).
Thus, CLL's epigenetic landscape is marked by decreased co-ordination between different layers of the epigenome. Our data suggest that ongoing diversification of the CLL cell population leads to an admixture of cells with diverging epigenetic identities, resulting in incomplete gene silencing by the Polycomb complex, and unexpected co-occurrence of, typically mutually exclusive, activating and repressing histone modifications.
Landau: Pharmacyclics: Membership on an entity's Board of Directors or advisory committees.
Author notes
Asterisk with author names denotes non-ASH members.